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Chlamydomonas reinhardtii chloroplast Sulfate Permease (SulP) gene structure 
intron I 

5' UTR intron n_ intron EI intron IV 3' UTR 
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FIG. 3 

reinhardtii chloroplast Sulfate Permease (SulP) amino acid sequence 

MERVCSHQLASSRGRPCIAGVQRSPIRLGTSSVAHVQVSPAGLGRYQRQRLQWA SAAAA 
AAFDPPGGVSAGFSQPQQQLPQQHPRQPQAVAEVAVAESVSAPASAAPSNDGSPTASMDG 
GPSSGLSAVPAAATATDLiFSAAARLRLPNIiS P I ITWTFMLS YMAFML IMP I TAIjLQKAS L» 
V P LNVF IARATEP VAMHAYYVT F S CS L I AAA IN CVFGFVLAWVLVR YN FAGKK I LD AAVD 
LPFALPTSVAGLTLATVYGDEFFIGQFLQAQGVQWFTRLGWIAMIFVSFPFVWTMQP 
VMQEIQKEMEEAAWSLGASQWRTFTDWLPPLLPALLTGTALAFSRALGEFGSIVIVSSN 
FAFKDLIAPVLIFQCLEQYDYVGATVIGTVLL^ (SEQ ID NO:l) 
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Coding sequence of CrcpSulP 

5' UTR:173 bp, Exonl: 124 bp, intronl: 77 bp, Exon2: 78 bp, 

intronll: 279 bp Exon3: 620 bp, intronlll: 834 bp, 

Exon4: 87 bp, intronlV: 699 bp, Exon5: 327 bp, 3'UTR: 575 bp 

Total length: 3873 bp 

gcttagtacc taagcaaaaa taccaaagcc ttatcctgag ttgtcaacaa gaactccagc 60 
ctgcgacgat gcaaagcctt tcttgagcgg gttgatggac tttgctttgt tatctgtcca 120 
gtaagccacc agacactacc aagtagagta atccatttgt ataggtacag aatatggagc 180 
gagtttgcag ccatcagctt gcctcgtcgc gagggaggcc atgcatcgct ggggtgcagc 240 
ggtcgcccat ccgactaggg acttcaagcg ttgctcatgt gcaggtctct ccggcaggta 300 
agcaccgcgc tcggcggcgt gtacacatgg ggccgtcagg ccaactgcgt ttgttggcta 360 
tgcaaccgaa acaggccttg ggagatatca acggcaaaga ctgcaagtcg tggcgtctgc 420 
agctgcggca gcggctttcg accctcctgg aggtgcgtgg cgtgagggct gcacgggtgc 480 
gggttggcct ggaaaccaag cctcgccacg actacctgca acagcattgc ccgcatctcc 540 
agcccctcac cctcgagtgc ctcccgaaga cctctatccc ctgcgcatca ttggttcggg 600 
ggcgccgcct gcgggccttg ggcgctggct acgctgaccg cacggcacga cttggcacgg 660 
cctggcgcgg cctgagcggc cccccccctc ctgatggccc cacgctttgc cgcccacgcc 720 
gctccccgca ggtgtctccg ccgggttctc gcagccgcaa cagcagctgc cacaacagca 780 
cccacgccaa ccacaggcgg tggcggaggt agctgtcgcc gagtcagrtct cggcgcccgc 840 
ttctgcggcg ccctccaatg atggctcgcc cacggcctcc atggacggcg gccccagctc 900 
pggcctcagc gccgtgcccg ccgccgccac cgccaccgac ctcttctccg ccgcggcgcg 960 
cctccgcctg cccaacctct cccccatcat cacctggacc ttcatgctct cctacatggc 1020 
cttcatgctc atcatgccca tcaccgcgct gctgcaaaaa gcctcgctcg tgccgctcaa 1080 
cgtcttcatc gcgcgcgcca ccgagccggt ggcgatgcac gcctactacg tcaccttctc 1140 
ctgctcgctg atcgcggccg ccatcaactg cg t gbttggc ttcgtgctgg cctgggtgct 1200 
ggtgcgctac aatttcgcgg ggaagaagat cctggacgcg gcggtggacc tgccgttcgc 1260 
gctgccgacc tcggtggcgg gcctcacgct tgccacggtg tacggcgacg agttcttcat 1320 
cggccagttc ctgcaggcgc agggcgtgca ggtgcgtgcg tatagcatag tggagtgtgg 1380 
ttagcagctg ggggtccggc agtagttccc gccctagtga ggrtcgaaact ataccagaag 1440 
aagaggacga acatggggct atccagcaag ctcgtctagg gaaggaggag tttgggagaa 1500 
cggtggggtg ggagggagag ggagggcgtt ggctgggagg gaagggtaag gcgggaggga 1560 
gatggtagca cggggcgttg gggacgcaga aggatgacag gcggctgcag ggaagggatg 1620 
gggaagcgga gctggggaca gtgcgaagag ccgggagaga ggggaagttt gagtcaggaa 1680 
gaggggctag agaggggcat gcggactcct gctgggattt aggtgcgtgc tcattgagga 1740 
gcccttggaa tcagcggacg gaaacgtggc cgacggggtc tgccgagcac accaggctag 1800 
ctagacgcgc ggttgggcaa cgagcagagc tgctgtgcgg ctatggatgg aaggcgatgc 1860 
agcgagcatg tgcagtgaac attggtttga ggacagggga ctccgaggtt gcataggcgg 1920 
gccgccactg tctctgccgc tagggtgact agctgcctcg aacctggcgg tggccccata 1980 
cccgcagbtg gaggatgctc cacgcgcttc agcttgccat gtctggggtc tgggtctgga 2040 
cgcaatcagc gtgtgagggt ccaactctat atggaattat ggataccttc caactaccag 2100 
cacgtaggct gccggaacgc ggctgaagcg gctggcctgc cccctcatcc tctcgttccc 2160 
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ctgtttttgt cccctgtcca cccaggtggt grttcacgcgg ctgggtg~tgg tgatcgccat 2220 
gatcttcgtg fcccttcccct tcgtggtgcg caeca tgcag cccgtcatgc aggtgagagc 2280 
geccaggagg eggagecatg gcgggttggg gcgggttggg gcgggttggg gcggggcgcg 2340 
gatggggegg cttggggagt aatgtggggc ggatggggtg gcagcctggc agggtatggg 2400 
agegagagga tageggggae aggggacagg gaagggaagg gaaggggaag gatgecctat 2460 
gcgagcaaag ggggtatggg aaceggeggt tggggctggg agegaeggga gcagggaggg 2520 
agtgcacgga aegggggcaa ggeggacagg gtgagggagg gtgcaggccg gactgggatg 2580 
ggtcatgtg t cctggtcggg ggtgtagccg tgggaggegg geaggcageg tgtgttctgg 2640 
cacggtgttt tggegaaaga taccacggca tggtatgggg ccagttgggc agggaagaac 2700 
cgttggacac gacttcgttg acagatctag ttcattgeae ccgggtcgca ccaagggtgg 2760 
cggcgagccc ggcccggcac gtccgagtac cccggagccg taacgccgca acccgccttg 2820 
ttgcgcccct tccctgctcc cctgctccgc atacegtgea ccatgccctc tgccgccccc 2880 
tcaggccctc aggccctcac ctccccctca cctcctccta acgccttccc ctcgccttcc 2940 
cttcccctcc caacgccacc aegtgeaaca ggaaatccaa aaggagatgg aggaggegge 3000 
atggtcgctg ggcgcctcgc agtggcgcac cttcacagac gtggtgctgc cgccgctgct 3060 
gcccgcgctg ctgaccggca cggcactggc cttctcgcgc gcgcttggcg agttcggatc 3120 
cattgtcatc gtgtcctcca actttgeett caaggacctg atcgcgcccg tgetgatett 3180 
ccagtgcctg gagcagtacg actacgtggg cgccaccgtg ateggcacag tactgctgtt 3240 
gatttegctg gtgatgatgt tggcggtgaa ccagctgcag aagctggcgc gcaagtgagg 3300 
ggctgaggcg tttgaggaga gtgggcgtct geggaggege ttgtggcgca ggggcaggtg 3360 
gaggaggttg cagggtgagg caggagtggc aggtggtgga gggtgcaggg cgggg tgb tg 3420 
ggatgggatg ggatgggacc gtgggagggg tgggactttg ggtgggtggg agtgggtgct 3480 
aegtattagg atatgggagg tggta tgcag ttgaaggggg gggtggcaat ctggacgggg 3540 
actcactgtt tactaggcac geatgtcgea ggagtggata tcgatgggtg tggggatgtc 3600 
ageaegcttg gcttgagttg ggccatggga cccgggacta ggcttggttg cgagccgagc 3660 
cagtcaccag ggagaegtae gagcgcacac agtgattacg gggattgatt aggeggegaa 3720 
ttgacgcaaa tecaeggggg ctgtggcttg ggggaggcag ggattgagcg aaggaegcac 3780 
tgeaagctea ggcagtcgea tgcccgtacc ctgcttctgg tccagtgtgg agacaagact 3840 
ggcaatcgtg gtcctttgea attcatggcg cgc (SEQ ID NO: 2) 
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Full length cDNA sequence of CrcpSulP: 1984 bp 

gcttagtacc taagcaaaaa taccaaagcc ttatcctgag ttgtcaacaa gaactccagc 60 
ctgcgacgat gcaaagcctt tctbgagcgg gttgatggac tttgctttgt. tatctgtcca 120 
gtaagccacc agacactacc aagtagagta atccatttgt ataggtacag aatatggagc 180 
gagtttgcag ccatcagctt gcctcgtcgc gagggaggcc atgcatcgct ggggtgcagc 240 
ggtcgcccat ccgactaggg acttcaagcg ttgctcatgt gcaggtctct ccggcaggcc 300 
ttgggagata tcaacggcaa agactgcaag tcgtggcgtc tgcagctgcg gcagcggctt 360 
tcgaccctcc *bggaggtgtc tccgccgggt tctcgcagcc gcaacagcag ctgccacaac 420 
agcacccacg ccaaccacag gcggtggcgg aggtagctgt cgccgagtca gtctcggcgc 480 
ccgcttctgc ggcgccctcc aatgatggct cgcccacggc ctccatggac ggcggcccca 540 
gctccggcct cagcgccgtg cccgccgccg ccaccgccac cgacctcttc tccgccgcgg 600 
cgcgcctccg cctgcccaac ctctccccca tcatcacctg gaccttcatg etctcctaca 660 
tggccttcat gctcatcatg cccatcaccg cgctgctgca aaaagcctcg ctcgtgccgc 720 
tcaacgtctt catcgcgcgc gccaccgagc cggtggcgat gcacgcctac tacgtcacct 780 
tctcctgctc gctgatcgcg gccgccatca actgcgtgtt tggcttcgtg ctggcctggg 840 
tgctggtgcg ctacaatttc gcggggaaga agatcctgga cgcggcggtg gacctgccgt 900 
tcgcgctgcc gacctcggtg gcgggcctca cgcttgccac gy tg Lacggc gacgagttct 960 
tcatcggcca gttcctgcag gcgcagggcg tgcaggtggt gttcacgcgg ctggg t g t gg 1020 
tgatcgccat gatcttcgtg tccttcccct tcgtggtgcg caeca tgcag cccgtcatgc 1080 
aggaaatcca aaaggagatg gaggaggegg catggtcget gggcgcctcg cagtggcgca 1140 
ccttcacaga cgtggtgctg ccgccgctgc tgcccgcgct gctgaccggc aeggcactgg 1200 
cettctcgcg cgcgcttggc gagtteggat ccattgtcat cgtgtcctcc aactttgect 1260 
tcaaggacct gatcgcgccc gtgetgatet tccagtgcct ggagcagtac gactacgtgg 1320 
gcgccaccgt gateggcaca gtactgctgt tgatttcget ggtgatgatg ttggcggtga 1380 
accagctgca gaagctggcg cgcaagtgag gggctgaggc gtttgaggag agtgggcgtc 1440 
tgcggaggcg cttgtggcgc aggggcaggt ggaggaggtt gcagggtgag gcaggagtgg 1500 
caggtggtgg agggtgeagg gcggggtgtt gggatgggat gggatgggac cgtgggaggg 1560 
gtgggacttt gggtgggtgg gagtgggtgc tacgtattag gatatgggag gtggtatgca 1620 
gttgaagggg ggggtggcaa tctggacggg gactcactgt ttactaggca cgcatgtcgc 1680 
aggagtggat atcgatgggt gtggggatgt cagcacgctt ggcttgagtt gggccatggg 1740 
acccgggact aggcttggtt gegagecgag ccagteacca gggagacgta cgagcgcaca 1800 
cagtgattac ggggattgat taggeggega attgaegcaa atecaegggg gctgtggctt 1860 
gggggaggca gggattgagc gaaggacgea ctgcaagctc aggcagtege atgcccgtac 1920 
cctgcttctg gtccagtgtg gagacaagac tggcaatcgt ggtcctttgc aattcatggc 1980 
gege — i-984 

(SEQ ID NO: 3) 
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Green Algae 

(Visible light: 400-700 nm) 
Biomass & H 2 -production 
[2.5 mLH 2 L-1 h-1] 



Photosynthetic Bacteria 

(Infrared: 700-1000 nm) 
Biomass & H 2 -production 
[40-60 mL H 2 L-1 h-1] 



Anaerobic Bacteria 

Fermentation and 
H 2 -production 
[25-55 mLH 2 L-1 h-1] 



CATTCAATTTGCAGCGTTCCTAAAATGGCAAGCACAACGCTGCTCCAGCCCGCGCTTGGTCTGCCCTCGCGGGTAGGG 
CCTCGCTCCCCTCTGTCGCTTCCCAAAATTCCTCGCGTGTGCACGCACACTAGTGCTCCCTCTACCTCAAAGTACTGC 
GACTCATCATCAGTTATAGAGAGCACGCTAGGGCGGCAAACATCGGTTGCCGGGAGACCATGGCTTGCACCCCGGCCT 
GCGCCTCAACAAAGCCGAGGCGACCTACTGGTCTCCAAAT CGGGGGCAGCAGGAGGCATGGGCGCCCATGGAGGGGGC 
TTAGGGGAACCGGTCGATAATTGGAT CAAGAAGCTACTCGTTGGTGTCGCGGCGGCGTACATCGGCTTGGTCGTGCTG 
GTGCCCTTCCTGAATGTCTTCGTCCAGGCGTTCGCCAAGGGCATCATTCCCTTCCTGGAG CACTGCGCGGACCCGGAC 
TTTCTGCACGCACTCAAGATGACGCTGATGCTGGCGTTCGTGACGGTGCCGCTCAACACGGTGTTTGGCACGGTGGCC 
GCGATCAACCTCACGCGCAACGAGTTCCCCGGCAAGGTGTTCCTGATGTCGCTGCTGGACCTGCCCTTCTCCATCTCG 
CCCGTGGTGACTGGCCTGATGCTCACGCTGCTGTACGGCCGCACCGGCTGGTTCGCGGCGCTGCTGCGGGAGACCGGC 
ATCAACGTGGTGTTCGCATTCACGGGCATGGCCCTGGCCACCATGTTTGTGACGCTGCCGTTCGTGGTGCGCGAGCTG 
ATCCCCATCCTGGAGAACATGGACCTGTCGCAGGAGGAGGCGGCGAGAACGCTGGGGGCCAACGACTGGCAGGTGTTC 
TGGAACGTGACGCTGCCCAACATCCGCTGGGGCCTGCTGTACGGCGTGATCCTGTGCAACGCCCGAGCCATGGGCGAG 
TTCGGAGCCGTGTCCGTCATCTCGGGCAACATCATCGGCCGCACGCAGACGCTGACGCTGTTCGTCGAGTCCGCCTAC 
AAGGAGTACAACACGGAGGCGGCGTTCGCGGCGGCTGTGCTGCTGAGCGCGCTGGCGCTGGGCACCCTGTGGATCAAG 
GACAAGGTGGAGGAGGCGGCGGCGGCGGAGAGCCGCAAG TAGA GAGGAGCAGGCGGCGTCGGCAGCGGCGGCAGTGGC 
AGCGGCAGCGGCGGAGAGCGGCAGCTGGAGAGGAGCAGGCGGTGGCGGCGGAGCGGCGGAAATAGAGAGGTGCAGCAA 
GGAGGCAGGCGCCGACGCGAGGGGAGGGCGTGGTGGTGGGCTTGCGTGGGTGCTTGGTCCGTGGCCAGGGTGCCTGGC 
CTGGGTAGTTGGTGTGTGGGTGAAGCTGATTCCTGTTTGGGTGAGGCGGCCGAGTTCCTGAAGGAAGCAAGGAAGGAC 
AGTGCCGCAGTGACCAGCGGGTAATGGTAAGGGAGCTGACACGTGTGGCGTTCTGTTGCTGGTCGCCGCATGCTTAAC 
GCAGCGGGAGCAGCTTCTCTGTCTGATGTCTAACGGGGGCGTTGTATGCTGATAATAGACGGAGGGCGAAGGGAGCAG 
GCGCGGTTCAGATGGGGTAAAAGCTGTTGGAAATCAACACGTGCAGCGGGTGGGTTGCATTTGTGATCACTGGACGTT 
CTGAGTGGTCCGTGCGCCTATAGCGCGTGCTGTGCATATATACGCGCGCCGGCGCATAAAACATGACTGCATGTGTCG 
GTGTTGACGGTACAGTTATGCCGTGCCCCGTTTTACAAGCGGGATAGAGGCACACTCCACGTAGTATGCATTGAGCCC 
AGTAGACTCTGGTCAGAAGGCCGGTAAATTTACATGTGTCGTGGTGAACCCTGTAAGTCATGGCCCAAG 
(SEQ ID NO: 04) 
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GTACTTCAATTGTCAGAATGGCGTCGCTGCTCGCTCAAACAACATCGCGCCTTGGCGCTCGCCCAGCTGCGCAA 

GCTGGCCCTGTCGCCCAAATGGCACCGATGGCAAGCCGAGTGCAGCCGGCGATGCCTAGCGCGCTGCTCCCACT 

GCACGCCAGAGCGACAACAACTTCAGTCGCTTGCCGGGCAGCCAGCATCGACAAACCTGTCGTTTACACTCCTC 

GAGATTCGTCGCAACAGTCCTCCAATGGGGCAGGAGAAGTGTCCATGTCCATATCATCCATGGACGAGGTTGGA 

CCCTCTTATGAGGGAATCATTACAGACGCGCCTACACGACCAACGGGGCTTTATGTGCGGGTGCGCAACATGGT 

GAAGCACTTCAGCACCGCCAAAGGCCTGTTCAGGGCGGTGGACGGCGTGGACGTGGACATCGAGCCCAGCTCCA 

TCGTGGCGCTGCTGGGGCC CAGCGGCAGC GGCAAGAC CACATTGCTGCG CCTCATTGC AGGCCTGG AGCAGCC C 

ACGGGCGGCAACATCTACTTTGACGACACGGACGCGACCAACCTGTCCGTCCAGGACCGCCAGATCGGCTTCGT 

GTTCCAGAGCTATGCGCTGTTCAACCACAAGACAGTTGCGGAGAACATCAAGTTTGGACTGGAGGTGCGCAAGC 

TCAACATCGACCACGACAAGCGCGTGGCGGAGCTGCTGGCGCTGGTGCAGCTCACCGGCCTGGGCGACCGCTAC 

CCGCGCCAACTGTCGGGCGGCCAGCGGCAGCGTGTGGCGCTGGCGCGCGCCCTGGCCTCCAACCCGCGGCTGCT 

GCTGCTGGACGAGCCCTTTGGCGCGCTGGACGCGGTGGTGCGCAAGCAGCTGCGCACGGGGCTGCGCGAGATCG 

TGCGCAGCGTGGGCGTGACCACCATCATTGTGACGCACGACCAGGAGGAGGCGTTCGACCTGGCGGACAAGGTG 

GTGGTGTTCAACAGGGGCCTGGTGGAGCAGCAGGGCAGCCCCACCGAGATCATCAAGCGGCCGCGCACGCCCTT 

CATTATGAAGTTCGTGGGGGAGACCAACGTGGTGCCGGCCACGTCGCTGCTGGCCAAGCGCATGCGCTTCAACA 

CCTCCAAGACCAGCGTCATGTTCCGGCCG CACGACATTAAGCTGTTCAAGACGGTGCCGCCGGAGAGCGGCGAG 

GGCGCGCTGACCACGGTGGGCGCCAACGTGGCGGACAAAGCCAACCTGGGCTGGGTGGTCAAGTACACGCTGCG 

CTTCGATGACGACGTGGAGTGCGAGCTGCAGCTCAGCCGCGACCAGGACGAGCGCGAGTACAACCTGGTGGTGG 

GCAGCCGCGTGTTCGTGCACGTGCCGCACCGCACCATGATGGGCTTCAACGCCAGCGACGTGGACAGCACGCCC 

ATCGTGTAATGTGCGGGGTTGGCGGCTGTGGCCAGCGATTGTTGCAATGCAGTCCAGCGTGCTCTTGGTTTGGT 

TCCAGTGACACCCATCCAGGGCACAGGTCCCTGAGCAGCGGGTGTTGGTGATGGGTTGGAGCAGTTGTACCCGA 

TTCTCGCATGCAAGGGGGCGGGGCGCCCACGGGGTGGGAGAGCGGAATGGCGGTGAGGTGGGCTACTGCATGCG 

GCCGTGGAGGAACGGAGGGGTGCACAGGCGGGCAGGTAGACAGGCGGAGCGGGCTGGGTGAGCGGGGCTGTAGT 

TTGGGGGTGGAGGCCGTGCAGACTGGTTGGGATACTGACAGATCAATGAGCGGCGTCTGCTCCATGGGTCAGTA 

GGAGAGCGGTGTGGGTGTGTGCAGTTGCGAGTTCTGGAGCGTTGTGCGCCTCGCGCTGTGTGCGCGCGCCCGTG 

CGTCTGCGGGCGCTGTCGGAGACGGGCGATGTACATGAAGCTGGACCTGGGCCTGTCTCACAAATATCCCTTAT 

GTTAATAGTAGGATGTCGCAATCGTGCCTTGGAGCCCACCTGATGTGTGTGTCACAGGTGGCAGTAGTTTGGCC 

TTGCGGGAGGTAGCACGTCTTTCATGAGAGTGCGTGTGCGTGACCGCTTTTACATTGCCAATCACGCTGGAAGG 

TGAAACCATGCATCATGCGTGCTATCAGGAGATGCAGACGGCGGATTGCTGCCAAAATGTTCTGTTGTTGGTGT 

GCAGACTTGGTGGCGAAGGGGCCAGGCGCCCAGGGGTATGCTGCGTGCCAAGGAGCTGCTGCCGCCACGAGTGA 

CCAGCGAAACTTGTAAATTGAATATTGTATCCT (SEQ ID NO: 05) 
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GGGCAGCGTATAAGTA ATG TCGTTCTTGGCTCCCAGCT TAGGCGTCGCGCGGGGGATTCTGGAGCCGGCGAGTGC 

AGCGAGGCCGCCTGCGCACGCGGCCGGTCACGCACCCGTTCTAACAAGCGATAGGACTGGTGGACCTGCCGCTAA 

TCATGACAGGCCTGCCGGTGCTCCCAGCCCCCATGCGGCGTCGTTGACGCCCTCCAGCAGCGGGCAAGCAAGCCA 

GCAAGGCGACCCCCAGCGCTCGCAGCACCAGCAAGCGCAGCGCCAGGACCAGCAGCAGTCGCAGTCGCGGTCGCT 

CCAATCACACCTCATCACCGCGGCCACGCTGCTGCCAGCCCTGCCGCCTCCGCCTCCCGGCGGCAACGGCGACGG 

CGATGGCGGCGAAGCTGCGGGGCCGCAGCCGCTCGCGGACGTCGCGGCTCAGCCGCCGGAGGTTGTGCTGACGCT 

GGCGTCGTTCGCGGTGAC CAAGCTGG CGTACGTGCGTGTGACGCGCGCGTTCCGGGAGTGGTACGAGCGCACGAA 

GGGCGTGGATGTGCGCTTCCGCCTCACCTTCGCCGCCAGTGGCGTG CAGGCCCGCGCCGTGATCGATGGCCTGCC 

CGCCGACATCGTGGCCCTGGCGCTGCCTCTGGACCTGGACAAGATCGTGTCGGCGGGGCTGATCCGGCCCGACTG 

GCGCAGCGCCTACCCGGCAGCCAGCGTGGTGTGCGAGACCACCGTGGCGTTCGTGGTGCGCCAGGGCAACCCCAA 

GAACATCCG CACCTGGGAGGACCTCACGCGGGCGGGTGTGGAGGTGGTGCTGGCCAACCCCAAGACCGCCGGAGT 

GGCCAGGTGGATCTTCCTGGCCCTGTGGGGCGCCAAGATGAAGAAGGGCAACGCCGCCGCGCTGGCGTATGTGCA 

GCGCGTGTTCGAGAACGTGGTGGTGCAGCCGCGTGATGCGCGCGAGGCGTCGGACGTGTTCTATAAGCAGAAGGT 

GGGCGACGTGCTGTTGACGTACGAGAACGAGGTGATCCTGACCAACGAGGTGTACGGCGACAAGGCGCTGCCGTA 

CCTGGTGCCCTCCTACAACATCCGCATCGAGTGCCCGCTGGCGCTGGTGGACAAGGTGGTGGATGCCCGCGGCCC- 

CGAGGTGCGCGAGGCGGCGTCCGAGTTCTGCCGTTTCCTGTTCACGCCCGCGGCGCAGCACGAGTTCGCGCGGCT 

GGGCTTCCGCGTGAACCCGCGCACCTGCAAGGAGGTGGCGGCGCAGCAGACCGGACTGCCGCCCGCAAACCTGTG 

GCAGGTGGACAAGGAGCTGGGCGGCTGGGCTGCGGCCCAGAAGAAGTTTTTCGACGCTGGCGCCATCCTTGACGA 

CATCCAGTCCGCCGTGGGCAAGCTGCGTGTGGAGCAGCGCAAGGCGGCGCAGGCGGCGGCCAGGCGGTAGAGAGA 

CGCGGTACAAGTGCTCGGGTGCTCAGCAGGAGCTGCAG CAGGGGCAGCAAGAGGGCCTTGACAGGAGGGAATGGT 

AGGCAAAGGCGGCAGGGGAGGCGGGATGGCGGGATGAAGTGAGGGTGTGCAAGCAGCGATGTGTGCCAAGGACGG 

TGTCGGCGATGTACATGATAACATGAGGAGACAGGAGCATCTCCTGGCAGGAGGCGGCAACCGTGGAGTGTCTGA 

AAGGAGAACTTGATTGCTCAGTGTGGGACAGATAACGGAGGGCGGGGTGTGGGGCGTGGGGCTTATCGGTGTGCT 

TCTATGGGGAGGCCTGACTGCATTGGGGGCGACGTAGTGTGATGGCCGCTACACGCTTGCTCGGAACTGACATAA 

AGAGGCGTTCAGGCCATGGCTGCATGAGGCTTGATGTCGTATCGCGGACTGTC (SEQ ID NO: 06) 
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MASTTLLQPALGLPSRVGPRSPLSLPKIPRVCTHTSAPSTSKYCDSSSVIESTLGRQTSV 
AGRPWLAPRPAPQQSRGDLLVS KS GAAGGMGAHGGGLGE P VDNW I KKLL VGVAAAY I GLV 
VLVPFLNVFVQAFAKGI I PFLEHCADPDFLHALKMTLMLAFVTVPLNTVFGTVAAINLTR 
NEFPGKVFLMSLLDLPFSISPWTGLMLTLLYGRTGWFAALLRETGINWFAFTGI^LAT 
MFVTLPFWRELIPILENMDLSQEEAARTLGANDWQVFWIWTLPNIRWGLLYGVILCNAR 
ANGEFGAVSVI SGNI IGRTQTLTLFVESAYKEYNTEAAFAAAVLLSALALGTLWIKDKVE 
EAAAAESRK* (SEQ ID NO : 07) 
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MASLLAQTTSRLGARPAAQAGPVAQI^PMASR 

RAAS I DKP WYTPRDS SQQS SNGAGEVSMS I S SMDEVGPS YEGI I TDAPTRPTGL 
YVRVRNMVKHFSTAKGLFRAVDGVDVDI EPSS I VALLGPSGSGKTTLLRLI AGLE 
QPTGGNIYFDDTDATNLSVQDRQIGFVFQSYALFNHKTVAENIKFGLEVRKLNID 
HDKRVAELLALVQLTGLGDRYPRQLSGGQRQRVALARALiASNPRLLLLDEPFGAL 
DAWRKQLRTGLRE I VRSVGVTT 1 1 VTHDQE E AFDLAD KVWFNRGL VEQQGS PT 
EIIKRPRTPFI.MKFVGETNWPATSLLAKRMRFNTSKTSVMFRPHDIKLFKTVPP 
ESGEGALTTVGANVADKANLGWVVKYTLRFDDDVECELQLSRDQDEREYNLVXGS 
RVFVHVPHRTMMGFNASDVDSTPIV* (SEQ ID NO: 08) 
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MSFLAPSLGVARGILEPASAARPPAHAAGHAPVLTSDRTGGPAANHDRPAGAPSPH 

AASLTPSSSGQASQQGDPQRSQHQQAQRQDQQQSQSRSLQSHLITAATLLPALPPPP 

PGGNGDGDGGEAAGPQPLADVAAQPPEVVLTLASFAVTKLAYVRVTRAFREWYE 

RTKGVDVRFRLTFAASGVQARAVIDGLPADIVALALPLDLDKIVSAGLIRPDWRSA 

YPAASVVGETTVAFVVRQGNPKNIRTWEDLTRAGVEVVLANPKTAGVARWIFLAL 

WGAKMKKGNAAALAYVQRVFENVVVQPRDAREASDVFYKQKVGDVLLTYENEV 

ILTNEVYGDKALPYLVPSYNIRIECPLALVDKVVDARGPEVREAASEFCRFLFTPAA 

QHEFARLGFRVNPRTCKEVAAQQTGLPPANLWQVDKELGGWAAAQKKFFDAGAI 

LDDIQSAVGKLRVEQRKAAQAAARR* (SEQ ID NO: 09) 
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Chloroplast Sulfate Transport System 




